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' For time-to-event data with finitely many competing risks, the 

, proportional hazards model has been a popular tool for relating the 

cause-specific outcomes to covariates [Prentice et al. Biometrics 34 

I I ' (1978) 541-554]. This article studies an extension of this approach to 

, allow a continuum of competing risks, in which the cause of failure 

' is replaced by a continuous mark only observed at the failure time. 

(-H ^ We develop inference for the proportional hazards model in which the 

■ regression parameters depend nonparametrically on the mark and the 
3 ' baseline hazard depends nonparametrically on both time and mark, 
y , This work is motivated by the need to assess HIV vaccine efficacy, 

' while taking into account the genetic divergence of infecting HIV 

I I viruses in trial participants from the HIV strain that is contained in 

^ , the vaccine, and adjusting for covariate effects. Mark-specific vaccine 

' efficacy is expressed in terms of one of the regression functions in 

QQ , the mark-specific proportional hazards model. The new approach is 

■ evaluated in simulations and applied to the first HIV vaccine efficacy 
O ' trial. 

cn ; 

■ 1. Introduction. It has been 30 years since Prentice et al. [14] intro- 
I duced a Cox regression framework for tlie analysis of failure time data in 

^ ' the presence of finitely many competing risks. Yet many important appli- 

• rH . cations of competing risks methodology involve continuous causes-of-failure 

rS I (marks). In HIV vaccine trials, for example, genetic divergence of infecting 
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HIV viruses from the HIV strain represented in the vaccine needs to be 
taken into account to properly assess vaccine efficacy, but the mark variable 
is essentially continuous because of the large number of mutations involved. 
Other examples of continuous mark variables include lifetime medical cost 
or a quality of life score associated with survival time [13]. The grouping 
of continuous mark data into discrete marks is unsatisfactory because that 
amounts to a coarsening of the data and the results will depend on the way 
the groups are defined. To address this problem, we develop inference for a 
proportional hazards model in which both the regression parameters and the 
baseline hazard function depend nonparametrically on a continuous mark. 

The paper is motivated by the need for new methods to analyze data 
from HIV vaccine efficacy trials. Approximately 15,000 new HIV infections 
occur each day [20], making development of a protective HIV vaccine a top 
priority for biomedical science. In efficacy trials thousands of HIV-negative 
volunteers are randomized to receive vaccine or placebo, and are monitored 
for HIV infection. Five efficacy trials have recently been conducted. A pri- 
mary objective of each trial is to assess vaccine efficacy (VE) to prevent 
infection, where typically VE is defined as one minus the hazard ratio (vac- 
cine/placebo) of HIV infection. One of the greatest barriers to achieving 
an efficacious vaccine is the extreme genetic heterogeneity of HIV [6, 11]. 
Although it may be possible to develop a vaccine that protects against HIV 
strains genetically similar to the HIV virus or viruses represented in the vac- 
cine, it may be quite difficult to develop one to protect against HIV strains 
dissimilar from the vaccine material. This phenomenon is well known for 
flu vaccines — moderate genetic mismatch between an exposing flu virus and 
the virus represented in the vaccine causes vaccine failure, which has necessi- 
tated development of a new vaccine each year that is closely matched to the 
contemporary circulating flu strains. The genetic divergence (or distance) 
between two aligned HIV sequences can be measured as the weighted per- 
cent mismatch of amino acids, and since this distance may be unique for all 
infected subjects, it is natural to consider it as a continuous mark variable. 
The formidable problem of HIV genetic diversity implies that an important 
objective of an efficacy trial is assessment of if and how VE depends on the 
genetic divergence. 

This problem can be addressed in terms of the conditional mark- specific 
hazard function, defined as 

X{t,v\z)= lim P{T £[t,t + hi), 

(1) 

V e[v,v + h2)\T>t,Z{t) = z}/hih2, 

where T is the failure (infection diagnosis) time, y is a continuous mark vari- 
able and Z{t) is a (possibly time-dependent) p-dimensional covariate. Huang 
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and Louis [7] developed the nonparametric maximum likelihood estimator of 
the joint distribution of T and V in terms of the unconditional mark-specific 
hazard function. Gilbert, McKeague and Sun [5] defined mark-specific vac- 
cine efficacy as VE(t,f) = 1 — A(t,t'|l)/A(i,f |0), with z being the indicator 
of membership in the vaccine group; they developed several nonparametric 
and semiparametric tests concerning VE(t,f). 

In this article, we develop the mark-specific proportional hazards (PH) 
model 

(2) A(t, v\z{t)) = Ao(t, v) exp{/3(t;)^z(t)}, 

where the baseline hazard function Ao(-,f) and the p-dimensional regression 
parameter [3{v) are unknown continuous functions of v. As far as we know, 
this model has never been studied in the literature, even though it is closely 
related to the discrete cause-of-failure models discussed by Prentice et al. 
[14]. The approach in the continuous case departs from the discrete case in 
that it is necessary to "borrow strength" from data in a neighborhood of v, 
with the data closest to v contributing the most. 

For the HIV vaccine trial application, we partition the covariate as z{t) = 
{zi, Z2{t))'^ , where zi is the treatment (vaccine) group indicator and Z2{t) 
is a vector of possibly time-dependent covariates. Then the vaccine efficacy 
defined above takes the simpler form V"E{v) = 1 — exp{Pi{v)), without any 
dependence on t. By assuming proportional hazards, model (2) can provide 
more powerful tests of mark-specific vaccine efficacy than the nonparamet- 
ric procedures of Gilbert, McKeague and Sun [5], and the model allows 
adjustment for covariate effects. Furthermore, ignoring the mark variable 
and studying vaccine efficacy using the standard Cox model, as is widely 
practiced in vaccine trials for many infectious diseases, can give misleading 
results. In fact, even in the case of model (2) with z as the treatment in- 
dicator, the ordinary (marginal) Cox model will be misspecified unless the 
baseline Xo{t,v) factors into separate functions of t and v. 

Indeed, consider the model A(t, v\z = 0) = 7o/2 + 'yitv and \{t, v\z = 1) = 
7o^^ + litv"^ : for t > 0, < f < 1, z G {0, 1}. The corresponding marginal 
hazard functions are \{t\z = 0) = 7o/2-|-7it/2 and \{t\z = 1) = 7o/2 + 7if/3, 
for t > 0. It is clear that \{t\z) is not a proportional hazards model unless 70 
or 7i is zero. If 71 = 0, the resulting marginal hazards become proportional 
for z = and z = 1. However, in this example, the marginal vaccine efficacy 
VE = 1 — X{t\z = 1)/A(t|2; = 0) = while the mark-specific vaccine efficacy is 
VE(f ) = 1 — 2v. The ordinary Cox model averages the mark-specific vaccine 
efficacy over its range, and important vaccine effects may be missed. This 
issue will be further illustrated in our simulation study. In general, use of 
the ordinary Cox model for studying hazard ratios can be misleading if an 
important mark variable is ignored. The mark-specific PH model offers a 
way to correct for that deficiency. 
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We also consider a cumulative vaccine efficacy estimand defined as CV(f ) = 
J^ YE(u)du where a > 0. We develop distribution-free uniform confidence 
bands for CV(f), which are useful for inferential purposes. In addition we 
derive test statistics for evaluating mark-specific vaccine efficacy based on 
the estimator of CV{v). 

The paper is organized as follows. Section 2 develops a local partial likeli- 
hood procedure for estimating f3{v), leading to the construction of pointwise 
confidence intervals and formal tests for various hypotheses of interest con- 
cerning vaccine efficacy. A simulation study evaluating the performance of 
the proposed tests and the pointwise and simultaneous confidence inter- 
vals for VE{v) and CV(u) is presented in Section 3. The proposed methods 
are applied to analyze the data from the first HIV vaccine efficacy trial in 
Section 4. We discuss some general aspects of mark-specific PH models in 
Section 5. Proofs of the main results are placed in the Appendix. 

2. Mark-specific proportional hazards model. 

2.1. Local partial likelihood. We begin by stating some assumptions and 
notations that are used throughout the paper. The mark variable V is as- 
sumed to have a known and bounded support; rescaling V if necessary, 
this support is taken without loss of generality to be [0,1]. The obser- 
vations {Xi,Si,6iVi,Zi), i = 1, . . . ,n, are assumed to be i.i.d. replicates of 
{X,6,6V, Z), where X is the right-censored failure time corresponding to 
T, which satisfies the model (2), and 6 is the indicator of non-censorship. 
The mark is assumed to be observed whenever the corresponding failure 
time is uncensored; when 6i = 0, Vi is undefined and is not meaningful. 
The censoring time is assumed to be conditionally independent of (T, V) 
given Z. 

We consider a localized version of the log partial likelihood function for 
/3 = I3{v) at a fixed v. 



where Kf^[x) = K{x jK) jh^ ^i') is a kernel function with support [—1, 1], r 
is the end of the follow-up period and h = hn is a bandwidth. Here Yi[t) = 
I{Xi > t) and Ni{t, v) = I{Xi <t,6i = 1, Vi < v) is the marked point counting 
process with a jump at an uncensored failure times Xi and the associated 
mark Vi. For background on marked point processes see Bremaud [2] and 
Martinussen and Scheike [10]. 

The log partial likelihood function (3) resembles that of Kalbfieisch and 
Prentice [8] in the case of discrete marks, except that it borrows strength 



(3) 




X Ni{dt,du), 
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from observations having marks in the neighborhood of v. The kernel func- 
tion is designed to give greater weight to observations with marks near v 
than those further away. The local maximum partial likelihood estimator of 
f3{v) is a maximizer f3{v) of (3). A similar approach has been studied by 
Cai and Sun [3] for estimating time-dependent coefficients in Cox regression 
models. 

Denote Hj = J uU-C {u) du, uj = J u^K'^{u) du for j = 0, 1, 2. For (3 G M^, 
t > 0, let 

n 

S(^)(i,/3)=n-^5^F,(t)exp{/?^Z,(t)}Z,(t)^^ 

i=l 

where for any z ^W, we denote z*^" = 1, z®^ = z and z®^ = zz'^ . Define 
s^^\t,[3) = ES^^\t,P) and 

'Jn[t,P) 5(0) (i^^) \S(0){t,P) 



Taking the derivative of l(v,(3) with respect to /? gives the score function 

U{v,P)=l'f,{v,l3) 



(4) 

n rl rr r S^^\t,P) 



i^^Jo Jo 



Zi{t) 



SW{t,P) 



Ni{dt,du). 



The maximum partial likelihood estimator is a solution to U{v,P{v)) = 
0, and can be computed using a Newton-Raphson algorithm. The second 
derivative of l{v,(3) with respect to (3 yields 



I'f,{v,f3) = -J2 t r Khiu-v)Jnit,f3)Ni{dt,du). 
,=1^0 Jo 



Although inference on /3 is usually of primary interest, the baseline func- 
tion Ao(t,w) can also be estimated, by smoothing the increments of the 
following estimator of the doubly cumulative baseline function Ao(t,f) = 
lolo Ms,u) dsdu: 

J / N r N(ds,du) 

(5) Ao{t,v) - ' ' ' ' 



Jo nS(^){s,p{u)) 
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2.2. Asymptotic results. We make use of the foUowing regularity condi- 
tions; not all of these conditions are required for the proof of each theorem, 
nor are they the minimum required set of conditions. 

Condition A. 

(A.l) P{v) has componentwise continuous second derivatives on [0,1]. The 
second partial derivative of Xo{t, v) with respect to v exists and is con- 
tinuous on [0,r] X [0, 1]. The covariate process Z[t) has paths that are 
left-continuous and of bounded variation, and satisfies the moment 
condition ^[||Z(t)||'^exp(2M||Z(t)||)] < cx), where M is a constant 
such that {v,l3{v)) £ [0,1] x {—M,MY for all u and ||^|| =maxfc^/|afc/| 
for a matrix A = {aki). 

(A. 2) For j = 0,1,2, each component of s^^\t,9) is continuous on [0,r] x 
[-M,M]P, and sup,g[o,r],eg[-M,M].||5(^'H*,^)-s(^'ni,^)ll=Op(n-V2). 

(A.3) (t, e)>0 on [0, r] x [-M, Mf and the matrix ^{v) = J{t, /3{v)) x 

Xo{t,v)s^'^\t, i3{v)) dt is positive definite. 
(A.4) E{Ni{dt,dv)\Tt^) = E{N,{dt,dv)\Yi{t),Zi{t)), where = a{I{Xi < s, 

Si = l),I{Xi<s,6i = 0),ViI{Xi<s,di = l),Zi{sy,0<s<t,i = l,...,n} 

is the (right-continuous) filtration generated by {Ni{s, v),Yi{s), Zi{s);0 < 

s <t,0 <v < l,i = l,...,n}. 
(A. 5) The kernel function K{-) is symmetric with support [—1,1] and of 

bounded variation. The bandwidth satisfies nh^ oo and nh^ — > as 

n oo. 



Note that the condition (A. 2) holds under the condition (A.l) given some 
additional moment conditions on Z{t) — Z{s) and exp(6-'"Z(t)) — exp(6-^Z(s)). 
If Z{t) = Z^ not depending on t, then (A. 2) holds by the Donsker theorem 
(Theorem 19.5 of van der Vaart [19]). The condition (A.4) assumes that 
the mark-specific instantaneous failure rate at time t given the observed in- 
formation up to time t only depends on the failure status and the current 
covariate value. Under (A.4) and by the definition (1), E{Ni{dt,dv)\!Ft-) = 
Yi{t)\{t,v\Zi{t))dtdv, andMj(t,v) = j^[Ni{ds,dx)-Yi{s)\{s,x\Zi{s))dsdx\ 
is a martingale with respect to !Ft for each fixed v ([10], page 31). Further, 
it follows by Aalan and Johansen [1] that Mi{-,vi) and Mj(-,f2) — Afj(-,vi) 
are orthogonal square integrable martingales with respect to J-t for any < 
"Wi ^ "^2 !^ 1 • To avoid the problems at the boundaries f = 0, 1, we shall study 

the asymptotic properties of (3{v) for the interior values of S [a, h] C (0, 1). 

First we present the following result that is essential for proving the 
asymptotic normality of (3{v) and provides insight into the constructions 
of the confidence bands and test statistics that follow. Let 

s«(t,/3(n))- 



(6) WA{v)=n-^/^Y. r r^(^) 

~; J a Jo 



Z^{t) 



sW{t,P{u)) 



Mi{dt,du), 
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where A{u) is a deterministic p x p matrix with bounded components and 
0<a<&<l. 

Theorem 1. Assume that each component of the p x p matrix A{v), 
v£ [a,b], is continuous. Under conditions (A.1)-(A.4), Wa{v) converges 
weakly to a p- dimensional mean-zero Gaussian martingale, Wa{v), with con- 
tinuous sample paths on v £ [a,b]. The covariance matrix ofWA{v) is given 
by Cov{Wa{v)) = A{u)E{u)A{u)du. 

Let 

(7) t^{v)=n-^J2 r r Mu)JnitJiu))A^iu)N,idt,du), 

where A{v) is a consistent estimator of A{v) uniformly in v E [a,b] C [0, 1]. 
It can be shown that Tia{v) is a consistent estimator of Cov{Wa{v)). 

The consistency and asymptotic normahty of P{v) are estabhshed in the 
next two theorems. 

Theorem 2. Under conditions (A.1)-(A.5), P{v) converges to (3{v) uni- 
formly in V £ [a, b] C (0, 1). 

Theorem 3. Under conditions (A.1)-(A.5), {nh)^/'^0{v) - f5{v)) 
N{0,i^oJ:-Hv)) forvG[a,b]. 

The proof of Theorem 3 uses a Taylor expansion of the score function, 
leading to p{v)-/3{v) = -{l'^{v,p*{v)))~^U{(3{v)), where p*(v) is on the line 

segment between P{v) and P{v)- The asymptotic variance oi n~^^'^h^^'^U{P{v)) 
is shown to be foS(f ), which is the in probability limit of )) = n~^h x 

J27=i lo Ioi^h{u — v))'^Jn{t, /3{v))Ni{dt, du). It can also be shown that T,{v) = 
—lg{v,P{v))/n — ^ as n oo. Thus, the asymptotic variance of (nh)^^'^ x 
0{v) - Piv)) can be estimated by ±i{v) = {l'/^{vj{v))/n)-^tnij3{v)){l'/i{v, 
f3{v))/n)~^ . An alternative estimator is S2(w) = —i^oil p{v , f3{v)) /n)~^ . It is 
easy to check that uq = 3/5 for Epanechnikov's kernel K{x) = |(1 — x'^), 
— 1 < X < 1. Simulations indicate that the two estimators have similar finite 
sample performance. 

Theorem 3 will lead to the construction of pointwise confidence intervals 
for VE(u). Simultaneous inference over v S [a,b] will be possible in terms 
of the estimate B{v) = f^ (3{u)du of the cumulative regression coefficient 
B{v) = f^ (3{u)du. We have the following weak convergence result for B{v). 
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Theorem 4. Under conditions (A.1)-(A.5), n^/'^{B{v) — B{v)) con- 
verges weakly to a p- dimensional mean-zero Gaussian martingale W-£-i{v) 
with continuous sample paths onv £ [a, b] . The covariance matrix ofW^-i (v) 
is T,{u)^^ du, which can be consistently estimated by defined by (7) 

with A{v) = and A{v) = {t{v))-^. 

2.3. Confidence bands for vaccine efficacy. Let f3{v) = (/?i(f),/3j(t)))'^. 
Then the vaccine efficacy can be expressed as VE(t; ) = 1 — exp(/3i(z;)). The 
estimated vaccine efficacy is VE(u) = 1 — exp(/3i(t;)). By Theorem 3 and 

the delta method, {nhy/^{YE{v)-YE{v)) ^ N{0,uoal{v)e^p{2f3i{v))) for 
V S [a,b], where crfiv) is the first element on the diagonal of T,~^[v). Let 
(5"^j(i)) be the first element on the diagonal of Tii{v). By the discussions 
on the consistent estimators for the asymptotic variance following Theorem 
3, is a consistent estimator for UQaKv). A pointwise 100(1 — a)% 

confidence band for V"E{v) is given by 

(8) \Eiv)±{nh)-'/^z^/2a/3,{v)exp0i{v)), a<v<b, 

where is the upper a/2 quantile of the standard normal distribution. 

To derive simultaneous confidence bands for the cumulative vaccine effi- 
cacy CV(f ) = VE(n) du, we consider the point estimator CV{v) = 
J^YE{u)du. Then 

V^(CV(f) - CY{v)) = r {exp{Pi{v)) - exp0i{v))) du. 

J a 

Note that ^/n{CV{v) - CY{v)) « ^ exp{pi{v){/3i{v) - Pi{v))du. FVom 
the proof of Theorem 4, it can be shown that ^/n{CY{v) — CY{v)) converges 
weakly to a mean-zero Gaussian process, eJWA{v), a<v<b, with continu- 
ous paths and independent increments, where A{v) = exp(/3i(7;))S(i;)~^ and 
ei is the first column of the p x p identity matrix. The variance of eJWA{v) 
equals p'^{v) = erf («) exp(2/3i(u)) dn by Theorem 1, which can be conve- 
niently estimated by (u) exp(2/3i(n)) du, where aKv) is the first element 
of the diagonal of T,{v)~^ . We suspect that this estimator may ignore the 
finite sample correlations of Pi{v) — Pi{v) at different values of v, thus over- 
or underestimating the true variance. We propose to use /5^(f ) = e'fT,^{v)ei 
as the estimator of the asymptotic variance of ^/n{CY{v) — CV(f )), where 
is obtained from (7) with A{v) = exp{Pi{v))T,{v)~^ , which is uni- 
formly consistent by Theorem 1. Consequently, a pointwise 100(1 — a)% 
confidence band for CY{v) is given by 



(9) 
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Let V be a set of values of v in [a, b] . We may take V = [a, 6] or V = {vk,k = 
1,...,K} with vi < ■ ■ ■ < vk- Note that if U{v) is a Gaussian martingale 
with variance p'^{v), for a <v <b, then U {v) p{b)[p'^ (b) + p'^{v)]~^ has the 
same distribution as {p^ (v) / {p'^ {b) + p'^{v))), a < v < b, where B^{-) is 
a Brownian bridge. By the weak convergence of ^/n{C'V{v) — CV('y)), the 
uniform consistency of p'^{v) to p'^{v) and the continuous mapping theorem, 
we have 

supl V^(CV(^;) - CV{v))p{b)/{p\b) + p\v))\ 

^snp\B°{pHv)/{p\b) + p\vm. 
vev 

Thus a simultaneous 100(1 — a)% confidence band for CV{v), v £V, is given 
by 

(10) CV(^) ± n-y^u^[p\b) + p\v)]/p{b), 

where Ua is the upper a-quantile of the distribution of sup^^y\B^{p'^{v)/ 
{p^{b) + p'^{v)))\. The Ua is the upper a-quantile of supo<„<i/2 

\B^{v)\ if 

V = [a, 6], which has been tabulated by Schumacher [15] for some a values. 
In the simulation study presented in the next section, we estimate Ua by the 
upper a-quantile of the distribution of sup^^gy \ B^ {p^ (vk) / {p^ (b) + /5^(ffc)))| 
in both cases when V = [a, b] or V = {vk, k = 1, . . . , K}, which can be ob- 
tained by simulating a Brownian bridge for given p^{v). 

Alternatively, other resampling techniques such as the Gaussian multiplier 
method of Lin, Wei and Ying [9] can be used to estimate the critical value 
Ua- This method can be briefly outlined as follows. Let ^i,...,^^ be i.i.d. 
standard normal random variables and 

5W(t,/3(«))" 



(11) Wl{v) = n~'l^j2iir r A{u) 

■ -I "^0 



Ziit) 



5W(t, /?(«)) 



Mi{dt,du). 



Then the distribution ^/n{CV{v) — CV{v)) can be approximated by the con- 
ditional distribution of efW^lv) given the observed data sequence, where 

A = exp(/3i(t')) X (S(ti))~^. Consequently, the distribution of sup^gy \ y/n{CY{v) - 
CY {v))p{b)[p'^{b) + p'^{v)]~^\ can be approximated by the conditional distri- 
bution of U* = sup^.gy |ef W^(f )/>(5)[p^(6) + /0^(f)]~^| given the observed 
data sequence. Let u* be the (1 — a)-quantile of the copies of U* obtained 
by repeatedly generating sets of i.i.d. standard normal random variables. A 
simultaneous 100(1 — a)% confidence band for CV(v), v £V, is given by 

(12) CV{v)±n-'/'u*^[p\b)+p\v)]/m- 

This resampling technique is also applicable to the hypothesis tests for vac- 
cine efficacy developed in the next subsection. 
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2.4. Testing vaccine efficacy. We are interested in testing the following 
two sets of hypotheses. The first set of hypotheses is 

Hio -.YEiv) = ioi ve[a,b] 

versus Hia ■ VE('y) ^ for some v (general alternative) 

or Him. : VE(i)) > with strict inequality for at least some v 

(monotone alternative) . 

The second set of hypotheses is 

H20 : VE(t;) does not depend on u G [a, b] 

versus H2a ■ VE(?;) depends on v (general alternative) 

or H2m-^^iv) decreases as v increases (monotone alternative). 

Let Pi{v) be the first component of P{v). Then the null hypothesis Hiq is 
equivalent to Pi{v) = and the null hypothesis H20 is equivalent to (3i{v) 
does not depend on v. The null hypothesis Hiq implies the vaccine affords 
no protection against any infecting strain of virus. The alternative Him in- 
dicates that the vaccine provides protection for at least some of the infecting 
strains, while Hia states that the vaccine provides either protection or in- 
creased risk for some infecting strains. The null hypothesis H20 implies there 
is no difference in vaccine effect for different infecting strains, measured by 
their distance v to the strains contained in the vaccine. The ordered alter- 
native H2m states that vaccine efficacy decreases with v and the alternative 
H2a indicates that the vaccine efficacy changes with v. 

In this section, we develop some test procedures for detecting departures 
from HiQ in the direction of Him and Hia and for detecting departures 
from H20 in the direction of H2m and H2a- By Theorem 4 and the discus- 
sions in Section 2.3, the process ^/n{C'V{v) — CY{v)), a <v <b, converges 
weakly to a Gaussian martingale with predictable variation p'^{v). Let S,{v) = 

y/n{CV{v) - CV{v))/p{b). It follows from Theorem 4 that C{v) W{t{v)), 
a<v<b, where W{-) is a Wiener process and t{v) = p^{v)/p^ib). 

To test Hio, let Z^^\v) = ^CV{v)/p{b) and i{v) = p^{v)/p^{b). Consider 
the following test statistics: 

Ti^) = f\z^'\v)fdi{v), T^^l = f'z('\v)di{v). 

J a J a 

These test statistics have somewhat complicated null distributions (see be- 
low) so we also consider the following simpler test statistic based on a finite 
grid, which leads to a standard normal null distribution: 

Tj^l = {K- ir'I^Y.{Z^'\vu) - zW(vk^i))/{i{vk) - i{vk-i)f\ 

k=2 
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where a < vi < ■ ■ ■ < vk < b are the grid points in [a, b]. A similar test statis- 
tic with a standard normal null distribution is also proposed for H20 later. 

Under H^o, tP ^ j'jW{t{vW dt{v) ^ J,\W{t)rdt, r« ^ 
Jl^ W{t{v)) dt{v) = Jq W{t) dt and T^i iV(0, 1). The distributions of t]^^ 
and r^\^ under Hio can also be approximated by those of {W{i{v)))'^ di{v) 
and W {i{v)) di{v) for given i{v), respectively, which are used in the nu- 
merical studies for better finite sample approximations. We denote the upper 
a-quantiles of these two distributions by c^^ and c^{, respectively. 

The test statistic Ta^^ captures general departures Hia, while the test 
statistics T^l and T^l sensitive to the monotone departure Him- Both 
test statistics T^l and T^l are likely to be positive when VE(t;) > for all 
V with strict inequality for some v. Hence the tests based on Ta^\ T^l and 
T^2 reject Hiq if ri^-* > ci\ T^H > c^^ and T^l > ^a, respectively. 

To test let Z^^\v) = ^Mi^^i^) - br^CV(6))/p(6). Note that, 

under H^q, Z^'\v) = ^[^JCV{v) - CY{v)) - ^(CV(fe) - CV(6))]//)(6). 

By Theorem 4 and the continuous mapping theorem, under H20, Z^'^\v) 
:^W{t{v)) - ^W{1) = Z^^\v) for v e [ai,b], where a < ai < 6. We pro- 
pose the following test statistics for evaluating H20: 

\zi^\v)fdi{v), T^2=f 



TP = I {Z('\v)fdi{v), ) = / Z('Kv)di{v) 



k=2 

where ai < u 1 < • • • < vk < b are K grid points in [ai, 6], 7r| is an estimate of 
the variance -7r| = Yai{Z^'^\vi^_i) — Z^'^^Vj^)) and is an estimate of the 

variance 11^ of J2k=2i^^'^\''^k-i) — Z^'^\vk))/TTk- By the covariance of the 
Wiener process, it is easy to show that 



Ti 



Cov{Z^'\vi),z(^\vj)) 



tjVi) t{Vi) t{Vj) _^ 



{vi — a){vj—a) {vi — a){b — a) {vj — a){b — a) ' {b — ay 
for Vi < Vj. Thus, 7r| = Tk^i^k~i - 2rfc_i,fc + Tk,k- Let T = {Tij)KxK and 



It follows that Hk = C"^rC- The estimates vr^ and are obtained by re- 
placing t(v) with t{v). 

By the weak convergence of Z^'^\v) to Z^'^\v), and the convergence in 

(2) D 

probability of t{v) to t{v), ai < v < b, we have — ^ -^(0, 1) under H20. It 
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also follows that tP ^ jl^^{Z(^\v)f dt{v), and T^^l ^ jj;^ Z^^^v) dt{v) 

(2) (2) 

under H2o- The distributions of Ta and under H20 can be approxi- 
mated by those of j!^^{W{i{v))/{v - a) - W{i{b))/{b - a)^ di{v) and 

Ja^{W{t{v))/{v - a) - W{t{b))/{b - a))di{v) for given i{v), respectively, 

which are used in the numerical studies for better finite sample approxi- 

(2) 

mations. We denote the upper a-quantiles of these two distributions by Ca 

(2) 

and c^l, respectively. 

(2) 

The test statistic Ta captures general departures H2a while the test 

statistics T^^l and T^l are sensitive to the monotone departure H2m- Both 

(2) (2) 

T^^( and ^re expected to be positive when YE{v) decreases as v in- 
creases, that is, when H2m holds. Hence the tests Ta'^\ T^^l and T^^l reject 
H20 if Ta'^^ > ci\ T^i > c^i and T^l > ^a, respectively. 

3. Simulation study. In this section, we conduct a simulation study to 
check the finite sample performance of the proposed estimation and hypoth- 
esis testing procedures using the simple mark-specific proportional hazards 
model: 

(13) X{t,v\z) = exp{-fv + {a + Pv)z}, t>0,0<v<l, 

where a, P and 7 are constants and the treatment indicator z takes value 
or 1 with probability of 0.5 for each value. Under model (13), the mark- 
specific baseline function is Xo{t, v) = exp(7t') and VE(t;) = 1 — exp(a + f3v). 
The null hypothesis Hiq of no vaccine efficacy holds if both a = and (3 = 0, 
and the null hypothesis H2Q that vaccine efficacy does not depend on the 
type of infecting strain is true if /3 = 0. Various choices of a and (3 specify 
different alternatives for Hiq and H2o. 

We consider the following simulation models: 

(Ml) (a, /3, 7) = (0, 0, 0.3) , for the null hypothesis Hiq of no vaccine efficacy; 

(M2) (a,/3,7) = (—0.5,0.5,0.3), as the first alternative of Hiq; 

(M3) (a,/?, 7) = (—0.6,0.6,0.3), as the second alternative of Hiq; 

(M4) (q;,/3,7) = (—0.6,0,0.3), as the third alternative of Hiq; 

(M5) (a,/9,7) = (—0.69,0,0.3), for the null hypothesis H20 that vaccine ef- 
ficacy does not depend on the type of infecting strain; 

(M6) (a,/9,7) = (—1.2,1.2,0.3), as the first alternative of H2Q; 

(M7) (a,/9,7) = (—1.5,1.5,0.3), as the second alternative of H2Q; 

(M8) (a,/3,7) = (-1.8,1.8,0.3), as the third alternative of H2Q. 

The models (M2) to (M4) are considered as the alternatives for Him and 
Hia- The departure from Hiq : VE(f) = increases as the simulation model 
moves from (M2) to (M4). The models (M6) to (M8) are considered as the 
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alternatives for and H2a- The departure from H2Q increases as the 

simulation model moves from (M6) to (M8). 

We generate the censoring times from an exponential distribution, inde- 
pendent of (T, V), with the censoring rates ranging from 20% to 30%. We set 
the interval of analyses for v as [a, h] = [0.1, 0.9] and bandwidths are chosen as 
h = 0.05, 0.1, 0.15. The observed failure times with marks outside the interval 
[a, h] can also be used since the smoothing at v takes the cases with marks in 
its /i-neighborhood. The Epanechnikov kernel K{x) = 0.75(1 — < 1} 

is used throughout. Sample sizes of n = 500 and 800 are studied. 

For the tests t'^^ and , we take the grid of eight evenly spaced points 
in [a, 6] from 0.196 to 0.868. Table 1 lists the empirical sizes and powers of 
the test statistics xjp , t!^\ and T^^^ and Table 2 for the test statistics ri^^ , 

and T^2- The significance levels of these tests are given at a = 0.05. 
Both tables also list the coverage probabilities of the 95% simultaneous 
confidence intervals for CV{v), for v G [a, b] and for v in the grid. The critical 

values for the tests T^l and T^^l at nominal level 0.05 1.645. The 

critical values for the tests Ta^^ and Ta^\ T^^l and T^l are obtained by 
generating 10,000 Wiener processes W{-) with time parameter equal to t{v) 
and calculating the corresponding functionals of W{t{v)), as described in the 
previous section. Each entry in Tables 1 and 2 is based on 1000 repetitions. 

Most tests have appropriate sizes close to 5%. The test Ta seems to be 
conservative for the simulation models used in the study. The test T^l has 

better power than the tests Ti"^^ and z£^2- The test T^^l has better power 

(2) (2) — 

than the tests Ta and Z^2 . Therefore the tests that incorporate CV(v) over 

the entire range [a, b] present greater power than the simpler tests based on 

CY{v) over the grid. We also observed that the powers of the tests seem to 

be influenced by the selection of bandwidth, with greater power for a larger 

bandwidth. Similar plots (not included here) to Figure 1 and Figure 2 but 

with larger bandwidth h = 0.2 show that the estimated standard errors of 

CV(u) become smaller for larger h while the biases stay approximately the 

same, resulting in increased power for the larger bandwidth. We suspect that 

this phenomenon is associated with the sample size and the convergence rate 

of the normalized CV(f ) to a Wiener process. The dependence of the power 

on the bandwidth should become small as the sample size increases. Further 

study on the bandwidth selection is warranted. 

The coverage probabilities of the simultaneous confidence intervals for 

CY{v) are closer to the 95% nominal level for v on the grid than on [a, b] . 

This may be explained by the fact that the convergence for v over the entire 

range [a, b] is slower than the convergence on the grid. The evaluations of 

the proposed estimators for (3{v), VE(u) and CY{v) and their respective 

estimators of the standard deviations under some of the simulation models 
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Table 1 

Empirical sizes and powers of the tests Tji''\ T^^l and T^^ ^he nominal level 0.05, and 
coverage probabilities of the 95% simultaneous confidence intervals for CV(v) with v on 

the grid and on [a, b] 













Size/Power 




Coverage 


Model 


(a, 0. 


n 


h 




yd) 

ml 




Grid 


[a,b] 


Ml 


(0,0,0.3) 


500 


0.05 


2.9 


3.1 


7.8 


Q7 ^ 
y 1 .0 










0.1 


4.9 


5.9 


8.3 


yD.o 


Q7 A 

y / 








0.15 


5.1 


6.9 


7.3 


yD.z 


yo.o 






800 


0.05 


5.3 


2.8 


6.9 


Q^i Q 

yo.y 


yo.o 








0.1 


5.7 


4.7 


6.8 


95.5 


97.0 








0.15 


5.8 


5.2 


6.3 


95.6 


96.5 


A/TO / 


n cc AC n Q^ 
-U.O, U.O, U.o ) 


OUU 


U.Uo 


4o.4 


OD.O 


DO.Z 


97.6 


98.0 








0.1 


60.3 


71.4 


65.7 


97.0 


97.5 








0.15 


66.0 


77.4 


65.5 


96.7 


97.6 






800 


0.05 


69.1 


78.4 


77.5 


96.1 


96.8 








0.1 


80.3 


86.5 


80.1 


95.6 


96.7 








0.15 


82.9 


89.1 


80.1 


96.0 


97.2 


M3 (- 


-0.6,0.6,0.3) 


500 


0.05 


59.7 


70.0 


76.5 


97.5 


98.0 








0.1 


75.4 


83.9 


78.8 


96.9 


97.8 








0.15 


80.9 


87.2 


78.5 


96.9 


97.9 






800 


0.05 


83.7 


90.4 


87.6 


96.2 


96.9 








0.1 


90.8 


94.4 


89.6 


96.0 


96.8 








0.15 


93.0 


96.0 


89.6 


96.2 


97.2 


M4 (- 


-0.6,0,0.3) 


500 


0.05 


96.0 


95.6 


99.9 


97.0 


97.8 








0.1 


99.1 


98.8 


100 


96.7 


97.6 








0.15 


99.5 


99.7 


100 


96.7 


97.4 






800 


0.05 


99.9 


99.5 


100 


97.0 


98.0 








0.1 


100 


100 


100 


96.9 


97.3 








0.15 


100 


100 


100 


96.4 


97.4 



are presented in Figure 1 and Figure 2. The plots of the pointwise coverage 
probabihties for V"E{v) and for CV(f ) are given in Figure 3. These plots are 
based on n = 500 and h = 0.1. 

Now we demonstrate with a simulation example that the adoption of a 
standard method for testing the vaccine efficacy that ignores the mark is 
inefficient and can be misleading. We consider a special case of the model 
discussed in the Introduction, with X{t,v\z = 0) = 1 and X{t,v\z = 1) = 2v, 
for t>0, 0<f<l. The covariate z is again a treatment indicator taking 
values and 1 with probability of 0.5 for each value. The marginal hazards 
model ignoring the mark is therefore X(t\z = 0) = 1 and X{t\z = 1) = 1, for 
t > 0. The rest of the simulation setup such as the percentage of censor- 
ship, the kernel function and the bandwidth is the same as for the previous 
models. The model considered here represents both a proportional mark- 




Fig. 1. Plots of estimates for l3{v), VE{v) and CV(ii) under the models Ml, M2, M5 and M6 for n = 500, /i = 0.1. The solid dark lines 
are the true functions and the dashed lines are the averages of the estimates based on 1000 repetitions. The gray lines are the corresponding 
estimates for P{v), VE(w) and CV(u) of 50 random samples. 




05 




model M1 model U2 model M6 model M6 



Fig. 2. Plots of the standard errors under the models Ml, M2, M5 and M6, based on n = 500, fe = 0.1. The solid lines are the averages 
of the estimates of the standard deviations of j3{v), VE(i;) and CV(i'), while the dashed lines are the sample standard deviations of I3{v), 
VE(t;) and CV{v), based on 1000 repetitions. The gray lines are the corresponding estimates for the standard deviations of P{v), VE(i;) 
and CY{v) of 50 random samples. 
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Table 2 

Empirical sizes and powers of the tests Ta , T!^{ and ^he nominal level 0.05, and 

coverage probabilities of the 95% simultaneous confidence intervals for CV(v) with v on 

the grid and on [a, b] 



Size/Power Coverage 



Model 


(q:,/3,7) 


n 


h 


7^(2) 

a 


7^(2) 
-'ml 


7^(2) 
-'m2 


Grid 


[a,b] 


M5 (- 


-0.69,0,0.3) 


500 


0.05 


1.6 


3.7 


3.7 


97.0 


97.8 








0.1 


2.1 


3.7 


4.5 


96.5 


97.5 








0.15 


2.1 


3.5 


4.6 


96.8 


97.3 






800 


0.05 


2.3 


4.0 


2.9 


97.3 


98.3 








0.1 


2.6 


4.3 


3.2 


97.0 


97.6 








0.15 


2.1 


3.5 


3.0 


96.9 


97.4 


M6 (- 


-1.2,1.2,0.3) 


500 


0.05 


47.2 


67.6 


47.7 


97.9 


98.5 








0.1 


60.2 


76.7 


62.3 


97.1 


97.6 








0.15 


63.2 


80.3 


73.3 


97.5 


97.8 






800 


0.05 


69.2 


85.1 


69.2 


96.5 


97.2 








0.1 


80.4 


92.0 


80.4 


96.6 


97.6 








0.15 


84.2 


94.1 


88.4 


96.9 


97.8 


M7 (- 


-1.5,1.5,0.3) 


500 


0.05 


63.8 


81.4 


62.1 


97.7 


98.0 








0.1 


76.9 


78.0 


63.6 


97.2 


98.0 








0.15 


81.2 


91.7 


86.3 


97.6 


98.0 






800 


0.05 


85.1 


94.4 


82.6 


96.2 


97.1 








0.1 


93.2 


98.2 


91.8 


96.1 


97.6 








0.15 


96.0 


98.9 


97.4 


96.7 


97.7 


M8 (- 


-1.8,1.8,0.3) 


500 


0.05 


77.6 


89.1 


73.6 


97.8 


98.5 








0.1 


87.1 


95.6 


85.7 


97.3 


98.4 








0.15 


91.5 


96.9 


92.8 


97.7 


98.7 






800 


0.05 


93.5 


98.2 


91.4 


96.4 


97.4 








0.1 


98.2 


99.5 


97.0 


96.3 


97.5 








0.15 


99.3 


99.9 


99.2 


96.5 


97.9 



specific hazards model for \{t,v\z) and a proportional hazards model for 
A(t|z) = Ao(t) exp(/3z), with the mark-specific vaccine efficacy VE('y) = l — 2v 
and the marginal VE = 1 — exp(/3) = 0. The standard Wald test, denoted by 
T^, under the marginal Cox model is often used to test for the vaccine effi- 
cacy. As expected, the standard Wald test shows no power (Table 3). It is 
incapable of revealing any vaccine efficacy or that the vaccine efficacy de- 
pends on the mark, thus missing the important scientific finding that the 
vaccine protects against viruses with smaller mark values {V < 0.5) and 
increases risk of infection with viruses with larger mark values {V > 0.5). 
The example we constructed here shows the weakness of using the standard 
approach that ignores the mark and is what motivates the present research. 

4. Application. The first preventive HIV vaccine efficacy trial was car- 
ried out in North America and The Netherlands, and enrolled 5403 HIV- 
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V 

model M1 



70 -J 



0.2 0.4 0.6 0.8 1 



model M5 




70 



0.2 0.4 0,6 0.8 1 



model M2 



70 



— I 1 1 1 1 

0,2 0.4 0.6 0.8 1 



model M6 




70- 



0.2 0.4 0.6 0.8 1 



V 

model M3 



70 -J 



0.2 0.4 0.6 0.8 1 



V 

mode) M7 



Fig. 3. Plots of the pointwise coverage probabilities for VE{v) (gray lines) andforCV{v) 
(solid lines), based on n = 500, h = 0.1 and 1000 repetitions. The models on the left panel 
are Ml, M2 and M3. The models on the right panel are M5, M6 and Ml. 



negative volunteers at risk for acquiring HIV infection [4]. Volunteers were 
randomized in a 2:1 ratio to receive a recombinant glycoprotein 120 vac- 
cine (AIDSVAX) or placebo, and were monitored for HIV infection at semi- 
annual HIV testing visits for 36 months. The primary objective was to assess 
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Table 3 

Comparison of the standard Wald test with the proposed tests Ti^\ Tj^l, T^l, ri^', 

(2) 

and ^he nominal level 0.05 



Power 



n 


h 


T 


-*- a 


-'ml 




rr(2) 
-*- a 


T^(2) 
-'ml 


rr(2) 
-'■m2 


500 


0.05 


5.9 


14.9 


24.2 


16.6 


98.0 


99.4 


97.3 




0.1 




23.9 


35.7 


16.0 


99.6 


100 


99.8 




0.15 




27.9 


39.1 


15.7 


99.9 


100 


99.9 


800 


0.05 


6.1 


32.4 


39.6 


15.0 


100 


100 


99.6 




0.1 




43.1 


51.5 


13.8 


100 


100 


100 




0.15 




46.0 


53.3 


13.9 


100 


100 


100 



VE using the standard Cox model, and a secondary objective was to test 
i!fio : VE(t, f ) =0 and H20 : VE(t,t') = VE(t) for three different mark vari- 
ables V defined in terms of the percent mismatch of aligned amino acid 
sequences (for each infecting HIV sequence compared to the HIV sequence 
[named GNE8] contained in the AIDSVAX construct) in three subregions of 
HIV-gpl20. For brevity, in this article we consider only one mark V, defined 
as the percent mismatch of amino acids in the whole gpl20 region (581 amino 
acids long), where all possible mismatches of particular pairs of amino acids 
(e.g., A versus C) are weighted by the estimated probability of interchange 
[12]. The distance is based on the gpl20 region because this region con- 
tains neutralizing epitopes that potentially can induce anti-HIV antibody 
responses that prevent HIV infection [21]; the vaccine was designed to pro- 
tect by stimulating high titer antibodies that neutralize exposing HIVs. Of 
the 368 individuals infected during the trial, 32 had missing marks. Of the 
remaining 336 samples, all marks were unique (217 vaccine; 119 placebo). 

The vaccine efficacy is estimated and tested by adjusting for two covari- 
ates: age (ranging 18-62 years with mean of 36.5) and behavioral risk score 
(taking values 0-7) as defined in [4]. It is relevant to adjust for these co- 
variates because they predict infection rate and because trial participants 
with different values of these covariates may be exposed to HIV strains with 
different distributions of V. Both covariates are considered as continuous 
variables. The histograms of the rescaled mark values, ages in years and 
behavioral risk scores are plotted in Figure 4. We denote the treatment in- 
dicator by zi {zi = 1 for the vaccine and zi = for the placebo), age by 
Z2 and behavioral risk score by Z3, and denote the corresponding coefficient 
functions by f3i{v), (32{v) and Ps^v). Fitting model (2) with /i = 0.3, the 
plots of the estimates for Pi{v), P2{v) and Psiv) and their pointwise confi- 
dence bands are given in Figure 5. The plots of VE(i;) and CV(u) with their 
corresponding pointwise confidence bands adjusting for the two covariates 
Z2 and Z3 are given in Figure 6. 
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Adjusting for age and behavioral risk score, the Wald test statistic for test- 
ing the marginal VE = using the standard Cox model is —0.978, yielding 
a p- value of 0.328 for the two-sided alternative and 0.164 for the monotone 
alternative. Our test with the test statistic Ta^^ for Hiq : VE(u) = for all v 
versus the general alternative Hia yields a p- value of 0.1532. The p- values 
for testing against the monotone alternative Him 3-1^6 0.0916 for T^l and 

0.0228 for r^\\ These results give some, albeit weak, evidence of nonzero 

vaccine efficacy for at least one mark value; see Figure 6. 

In addition, adjusting for age and behavioral risk score, we conducted 

the tests to evaluate whether the vaccine efficacy varies with the mark. The 

p-value for testing H20 that VE(u) does not depend on v versus the general 

(2) 

alternative i?2a is 0.2067 for the test statistic Ta ■ The p- value for testing 

for the monotone alternative H2m is 0.9363 for the test statistic and 

(2) 

0.9047 for the test statistic T^2 - These p- values are expected given the plots 
in Figure 6 where VE(u) shows some tendency to increase with v. 

5. Discussion. This article developed inference techniques for the pro- 
portional hazards model with a continuous mark variable, including non- 
parametric methods for estimation and testing of mark-specific regression 
functions. These techniques can be used to estimate mark-specific vaccine 
efficacy (VE(?;)) and cumulative mark-specific vaccine efficacy (CV(f )) with 
simultaneous confidence bands, and to test hypotheses for YE{v), while ad- 
justing for time-dependent covariate effects. The testing procedures based 
on the statistics T^l and T^l showed greatest power in simulations and are 
recommended for testing VE(t>) = for all v and for testing VE(u) indepen- 
dent of V, respectively. 

An alternative approach to the continuous mark-specific PH model would 
be a similar model that treats the mark variable as ordinal categorical. We 
focused on a continuous mark because (i) it most naturally suits the HIV 
vaccine application, as the choice of K bins for categorizing the marks would 
be arbitrary and (ii) testing P(v) = (3 can often be done with greater power 
than testing equality of the cause-specific regression coefficients /3i = • • • = 
[3k. 

As is well known for a discrete mark-specific hazard function, the interpre- 
tation of the continuous mark-specific hazard function X{t,v) is restricted to 
actual study conditions, that is, it is the instantaneous rate of failure in the 
presence of all of the circulating competing risks (i.e., is a "crude" hazard 
in the terminology of Prentice et al. [14]). However, often the main scientific 
interest is in the "net" mark-specific hazard, the instantaneous rate of failure 
by mark v in the absence of any other competing risks, but unfortunately 
this parameter is not identified except under untestable assumptions such 
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Fig. 4. Histograms for the observed mark values, ages in years and behavioral risk scores. The left panel is for the vaccine group and 
the right panel is for the placebo group. 



22 



Y. SUN, P. B. GILBERT AND I. W. MCKEAGUE 





> 



-0.2-1 

o.e- 

0.4 - 
0.2- 
0- 
-0.2 - 



0.2 



0,4 



0.6 



0.8 



V 

(b) 



0.2 



— I — 

0.4 



"oe" 



0.8 



V 

(c) 

Fig. 5. Plots of the estimated regression coefficients /?2(w) and /33(t') and their 

95% pointwise confidence bands for the vaccine trial data with ft = 0.3. 



as mutual independence of all of the notional (latent) mark-specific failure 
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Fig. 6. Plots of the estimates of YEf(v) and CV(d) and their confidence bands for the 
vaccine trial data with h — 0.3. The dashed lines are 95% pointwise confidence bands and 
the dotted lines are 95% simultaneous confidence bands. 



times [18]. This problem necessitates careful interpretation of inferences in 
the mark-specific PH model. 
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For the HIV vaccine trial example, the crude mark-specific hazard can be 
factored as 



where XE{t,v\z) is the intensity of exposure to strain v for participants with 
covariates z and Xpc{t\v, z) (the "per-contact" transmission hazard) is the 
same as X{t,v\z) except that it further conditions on the (unobserved) pres- 
ence of exposure to a virus with genetic distance v during [t,t + dt). Exposure 
can arise from unprotected sex or sharing a needle with an individual in- 
fected with strain v. Therefore the identified parameter measures a mixture 
of vaccine/placebo- group differences in mark-specific exposure rates and in 
conditional mark-specific per-exposure transmission probabilities, whereas 
biological interest is in 



as a measure of vaccine efficacy. However, as data are not available for esti- 
mating the relative intensity A£;(t,f 1 1,2:2 )/A£;(t,f 1 0,^2), our approach is to 
use 



as the target estimand, and assume identical exposure rates between the 
two groups, so this target has the same interpretation as VE^'-^(t|7;, 2:2). 
Reliance on this assumption demonstrates the value of including covariates 
Z2 that predict mark-specific exposure into the mark-specific PH model: the 
richer the covariate information the more likely VE(i, v\z2) reflects biological 
vaccine efficacy. Gilbert, McKeague and Sun [5] provided further discussion 
of the interpretation of mark-specific hazard ratios. 

The usefulness of our approach relies on the validity of the mark-specific 
proportional hazards model. Lin, Wei and Ying [9] developed goodness-of- 
fit tests for the standard Cox model based on martingale residuals, and 
their tests can be extended to the present setting by using the mark-specific 
martingale residuals 



(15) m{t,v)= [Niids,du)-Y^{s)eM0{u)fZi)Ao{ds,du)], 



for i = 1, . . . , n. These residuals may be interpreted as the difference at time 
t between the observed and the predicted number of events with mark less 
than V for the ith subject, and are informative about model misspecification. 
It can be checked that n-^/^J27=iM^{t ,v) = Op(l). This property is similar 
to that in the standard Cox model, where the sum of the martingale residuals 
is exactly zero. The difference here is caused by the kernel smoothing in a 



(14) 




YE^^{t\v,Z2) = l 




Xpc{t\v,0,Z2) 
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neighborhood of v. Because /3(f) is treated nonparametrically, the checking 
of the model (2) needs further development and has additional issues related 
to the bandwidth. This would need a thorough treatment that is beyond the 
scope of the present paper. 

Finally, we caution that the method proposed here requires large sample 
sizes to work well as demonstrated in the simulation study. This is the result 
of f3{v) being treated nonparametrically: the estimation of f3{v) utilizes only 
the observed failures with marks in a neighborhood of v. Although this does 
not cause a problem in our application to the first preventive HIV vaccine 
trial (which has a sample size of 5403), one needs to be careful in applying 
the method to situations with small sample sizes. 

APPENDIX 

The following lemma is an extension of Theorem 5.7 of van der Vaart [19] 
and will be used to prove the uniform consistency of P{v). 

Lemma A.l. Let Qn{v,6) be random functions and let Q{v,9) be a fixed 
function of {v,6) G [a,b] x 0, C W. Let f3{v) be a fixed function of v € 

[a, taking values in 0. Assume that swp^ g\Qn{v,9) — Q{v,9)\ — >0 and 
that for every e > there exists an r] > such that sup||g_^(^)||>£ (5(?;, 0) < 

Q{v,l3{v)) — r] for v G [a,b]. Then for any sequence of estimators (3{v), with 

Qn{v, Piv)) > Qn{v, P{v)) — Op(l) Uniformly in v ^ [o,^]; we have (3{v) — > 
f3{v) uniformly in v € [a,b]. 

Proof. For every e > 0, there exists an r/ > such that 
{sup mv) - P{v)\\ >e]c [jmv) - P{v)\\ > e} 

c\J{QivJ{v))<Q{v,(3{v))-ii}. 

V 

P 

Since Qn{v,f3{v)) > Qn{v,l3{v))-0p{l) — > Q{v,/3{v)), uniformly in v G [a,b], 
we have Qniv, ${v)) > Q{v,f5{v)) — Op{l), uniformly in t) G [a,b]. It follows 
that 

\J{Q{vJiv))<Qiv,(3iv))-7^} 

V 

C \J{Q{v, P{v)) < Qn{v, P{v)) -rj + Op(l)} 

= S^nfiQ{v,P{v))-Qn{vJ{v))) < -r/ + Op(l)| 
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= i^snpiQnivJiv)) - Q{vJ{v))) >rj- 

C !^snp\Qn{vJ(v)) - Q{vJ{v))\ > ?7- Op(l)|, 

whose probability goes to by the uniform convergence of Qn{v,0) to 
Q{v,d). Hence P{supJ|/3('t;) - > e} ^ 0. □ 

The following lemma is used to prove Theorems 3 and 4. Let A'^ = J27=i 
and M = J27=i Mi. 

Lemma A. 2. Under conditions (A.1)-(A.4), n~^N {t,v) ENi{t,v) , 
uniformly in {t,u) € [0,r] x [0,1], and n~^^'^M{t,v) converges weakly to a 
mean-zero continuous Gaussian random field G(t,v), {t,v) G [0,r] x [0,1], 
with independent increments and Mar {G{t,v)) = Jq Jq Xo{s,u)s^'^' {s, f3{u)) ds du. 

Proof. We treat cvi = {Xi,6i,Vi), i = 1, . . . ,n, as a random sample from 
a probability distribution P on a measurable space {X,A), with X = [0, oo) x 
{0, 1} X [0, 1] and A its Borel cr-field. Let T be the class of all indicator 
functions ft,v-^ — > R, where ft,v{uJi) = l{[0,t] x {1} x [0,v]){uJi) = I{Xi < t, 
di = l,Vi<v)JorO<t<T,0<v<l. Then n~^N{t,v) = n-^Y.i=i ftA^i)- 
Let ||/j,.||p,. = iP\ft,vn^^' be L,(P)-norm of ft,,. 

Let = to < ti < • • • < tx = T and = vq < vi < • • • < vj = 1 be parti- 
tions of the intervals [0, r] and [0,1]. Define the bracketing functions l/^j = 
Ni{tk-i,Vj_i) and Ukj = Ni{tk,Vj), for k = l,...,K, j = 1, . . . , J. Then for 
any ft^v G J^, there is a bracket [lkj,Ukj] such that ft^v G [lkj,Ukj]- And 

ll^^fcj -kj\\p,i < E{Ni{tk,Vj) - Ni{tk-i,Vj-i)) 

= r Xo{s,x)s'^°\s,f3{x))dsdx 
Jo Jo 

- r^^' Xo{s,x)s^^\s,p{x))dsdx 

Jo Jo 

< [\o{s,x)s^°\s,P{x))dsdx 
Jtk-i Jo 

+ r r Xo{s,x)s'^°\s,/3{x))dsdx 

Jo Jvj — i 

< Ci{tk - + C2{Vj - Vj^i), 

where Ci and C2 are some positive constants. For any e > 0, choose the grid 
points such that tk — tk-i < £ and Vj — Vj-i < e. Then Wu^j — lkj\\p,i < [Ci + 
C2]e. Hence, the bracketing number N\^.T^{e,!F,Li{P)) is of the polynomial 
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order By the Glivenko-Cantelli theorem (Theorem 19.4 of van der 

Vaart [19]), n~^N{t,v) ENi{t,v), uniformly in {t,v) G [0,r] x [0,1]. 

Next, consider the processes {Mi{t, v),0 < t < t,0 < v <1}, i = 1, . . . ,n, as 
a random sample from a probability distribution P on a measurable space 
{X,A). Let be the class of coordinate projections ft,v'-'^ — > where 
ft^^,{Mi) = Mi{t,v), for < i < r,0 < w < 1. The process {Mi{t,v),0< t < 
T,0 <v <1} is determined by the {Xi,6i,6iVi, Zi}. 

Again, let = to < ti < ■ ■ ■ < tx = t and = vq < vi < ■ ■ ■ < vj = 1 he the 
partitions of the intervals [0,r] and [0,1]. Define the bracketing functions 
kj = Ni{tk-i,Vj-i) - /o^/o'" Yi{s)X{s,x\Zi{s))dsdx and u^j = Ni{tk,Vj) - 
Jq''~^ Jq^~^ Yi{s)X{s,x\Zi{s)) ds dx, for k = l,...,K, j = l,...,J. Then for 
any ft^v G J^, there is a bracket [Ikj , Ukj] such that ft^v £ [hj , Ukj] ■ The bracket 
size is 



\\Ukj 



hj\\p,2 < \\Ni{tk,Vj) 
+ " ■ 



- Ni{tk-l,Vj^l)\\p^2 

Yi{s)X{s, x\Zi{s)) ds dx 



'"j-i 



Yi{s)X{s, x\Zi{s)) ds dx 



P,2 



< [Ci{tk - tk-i) + C2{vj - ^;J-l)]^/^ 



where Ci and C2 are some positive constants. For any e > 0, choose the 
grid points such that — tfe_i < e and Vj — Vj-i < e. Then \\ukj — lkj\\p,2 < 
[Ci + C2]^/^e^/^. Hence, the bracketing number N[.]{£^/^ ,J=', L2iP)) is of the 
polynomial order (1/e)^. Thus, N[.j{e,J-,L2{P)) is of the polynomial order 
(1/e)^. So the bracketing integral J[.]{1,J-,L2{P)) < 00. By the Donsker the- 
orem (Theorem 19.5 of van der Vaart [19]), n-^/^M = {n^^/^ ^"=1 Mi(t, v), < 
t < T,0 < V < 1} converges weakly to a mean-zero Gaussian process G{t,v), 
(t,v) G [0,r] X [0, 1], which can be constructed to have continuous paths by 
Theorem 18.14 and Lemma 18.15 of van der Vaart [19]. 

Now we show that G{t,v) has independent increments. Note that for 
ti < t2 and vi < V2, the covariance of G{ti,vi) and G{t2,V2) — G{ti,vi) 
is E{Mi{ti,vi) X {Mi(t2,V2) — Mi{ti,vi))}. By Aalan and Johansen [1], 
Mj(t,fi) and Mi{t,V2) — Mi{t,vi), <t < t, are orthogonal square inte- 
grable martingales for < vi < V2 < 1- It follows that 

E{Mi{tuVi){Mi{t2,V2) - Mi{h,vi))} 

= E{Mi{ti,vi){Mi{t2,V2) - Mi{t2,vi))] 

+ E{Mi ih ,vi){Mi{t2,vi) - Miituvi))} 

= 0. 
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Hence G{ti,vi) and G{t2,V2) — G{ti,vi) are independent. □ 

Proof of Theorem 1 . It is easy to check that the conditions of Lemma 
1 of Sun and Wu [17] are satisfied under Condition A. It follows that Wa{v) 
converges weakly to a vector of continuous mean-zero Gaussian random pro- 
cesses, Wa{v), V £ [a,b]. Now we show that Wa{v) has independent incre- 
ments. Let Wi{t,v) = mQA{u)[Zi{t) - s'-^\t,P{u))/s^'^\t,P{u))]Mi{dt,du). 
Then Wa{v) = n~^/^ J2i=i '^ii'^'i For a<vi<V2<b, the covariance ma- 
trix of Wa{vi) and Wa{v2) — Wa{vi) is equal to E{wi{T,vi){wi{T,V2) — 
Wi{'T,vi)Y'}. Since Mi(t, fi) and Mi{t,V2) — Mi{t,vi)^ 0<t<T, are orthog- 
onal square integrable martingales, it follows that WiitjVi) and Wi{t,V2) — 
Wi{t,vi), < t < r, are orthogonal square integrable martingales. Hence 
E{wi{T,vi){wi{T,V2) — Wi{T,vi))'^} = 0. So Wa{v) , V G [a, 5], is a vector of 
mean-zero Gaussian random processes with independent increments. 

Further, the covariance matrix of Wa{v) is 

E{Wi{T,v){Wi{T,v))'^} 



E 



E 



A{u) 
Aiu) 



A{u)Ni{dt,du) 



1 02 



: r A{u)E\[ 

Ja UO 



X A{u)y{t\Zi{t))\{t,u\Zi{t))dtdu 
s«(t,/3(n))T®2 



Z^{t) 



A{u)E 
X A{u) du 
A{u)T.{u)A{u)du. 
This completes the proof of Theorem 1. □ 



y{t\Zi{t))\{tMZi{t)) dt 



Proof of Theorem 2. 
conditions of Lemma A.l. 
Let 

n 

r]n{u,9) =n~^Yl 



i=l 



JO 



1=1 



^0 



We shall prove Theorem 2 by verifying the 

[9'^Z,{t) - log(5(°)(t, d))] N,{dt, du), 
[e^Z,{t) - log(s(°)(t, 6))] N,{dt, du), 



Qn{v,d) =n~^l{v,e) +n~^logn / Kh{u - v)N{T,du). 
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Qn{v,e) 



Kh{u - v)7]n{du,0) 

Kh{u - v)in{du, 9) + Op(n-i/2/i-i), 



uniformly in {v,9) G [0,1] x [— M, M], for M > 0. By application of the 
Glivenko-Cantelli and Donsker theorems, similarly to the proofs of Lemma 
A. 2 and Theorem 1, ^niv,9) = S,{v,9) + Op{n~^/'^), uniformly in {v,9) G 
[0,1] X [-M,M], with 



Jo 



[9^Z,{t)-log{s^'>\t,9))]Ni{dt,du) 



It follows that Qn{v,9) = Q{v,9) + 0p{n-^/'^h~^), uniformly in {v,9) G [a, 6] 



-M,M], where 



Q{v,9) = E 



Zi{t) - log(s(°)(t, 9))]\Q{t,v) eMl^^ {v)Zi{t))Yi{t) dt 



Now we show that P{v) is the well-separated point of maximum of Q{v. 
for V E [0,1]. Note that 

kit) 



dQ{v,9)/d9 = E 
d'^Q{v,9)/d9^ = -E 



Xo{t,v)eMP {v)Z,{t))Y,{t)dt 



s(o\t,9) 

{s(o){t,9) ~ W0){t,9)) 

X Xo{t,v)exp{(3'^{v)Zi{t))Yi{t)dt 



We have dQ{v, f3{v))/d9 = 0, and for every e > there exists an r/ > 
such that sup||g_^(„)||>j , ^) < Q{v,(3{v)) — rj iov v £ [a,b], under condi- 

tion (A. 3), by Taylor expansion and continuity. Further, since Qn{v,9) — > 

Q{v,9), dQniv,9)/d9-^dQ{v,9)/d9, and d^Qn{v,9)/d9^ ^ d^Q{v,9)/ 
89"^ uniformly in [v, 9) G [a, h] x [-M, M] , and -M < <M for a<v<b 
for some M < M, we have for every a > there exists an uq such that 
P(-M < P{v) <M,a<v<b)>l-afor n>no. 
Therefore, for every e > 0, 



P[ sup WPiv) - (3{v)\\ > e 

Ka<v<b 



<a + P[ sup \\P{v)- p{v)\\>£,-M <p{v)<M,a<v<b 

\a<v<b 



30 



Y. SUN, P. B. GILBERT AND I. W. MCKEAGUE 



as n — > oo, by the previous checking of the conditions of Lemma A.l together 
with Qn{v, (3{v)) > Qn{v, (3{v)). Since a is arbitrary, we have -P(sup^<^<^ ■ 
P{v)\\>e)^0. □ 

Proof of Theorem 3. In the proof of this theorem, we set j3 = (3{v) 
for simphcity. Note that under Condition A, using a second-order Taylor 
expansion for \{t^u\Zi{t)) in the neighborhood of we have 



n 



-1/2 



E 

1=1 



1 

Jo 



Kh{u - v) 



Zi{t) 



X [\{t,v\Zi{t)) - X{t,u\Zi{t))]dtdu 



uniformly in v G [0, 1] . It follows that 

rl 



r~i JQ Jo 



■ n 



-1/2 



i=l ■ 



E 



JO 

7 

JO 



Kh{u - v) 



Zi{t) 



i=l 



X [Ni{dt,du) -Yi{t)X{t,v\Zi{t))dtdu] 
SWit,(3y 



Khiu - v) 



z^it) 



5(0)(t,/3) 



Mi{dt,du) 



uniformly in u G [0, 1] . 

Next, we show that for each v, n~^^'^h^^'^U{v, converges weakly to 
a normal distribution. By Lemma A. 2, n~^^^M{t,v) converges weakly to 
a mean-zero Gaussian process. By Condition A, \\S^^\t,P) - s^^\t,P)\\ = 
Op(ra-^/^+''), uniformly in t for j = 0, 1, for < (5 < 1/2. Note that n-^/^-f 5/^-1/2 
o(l) for 5 = 1/4 as nh^ ^oo. We have h^^^Khiu- v)\\S'-^\t,(3) - s^^\t,(i)\\ 
goes in probability to zero. Applying Lemma 2 of Gilbert, McKeague and 
Sun [5], we have 



i^^Jo Jo 



Zi{t) 



.(1) 



(16) 



: n 



i^^Jo Jo 



X Mi{dt,du) + Op{n^/'^h^/'^) + Op{l) 



u — v) 



Zi{t) 
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X Mi{dt,du) + Op(n^/2/i^/2) + Op(l) 

= h^''^ r Kh{u - v)Wi{du) + Op(ni/2/i5/2) + 
Jo 

where Wj{v) is defined in (6) with A = I and a = 0. 

Since Wi{v) — > Wi{v) by Theorem 1, by the almost sure representa- 
tion theorem ([16], page 47), there exist Wf{v) and on some prob- 
abihty space that have the same distributions and sample paths as Wi{v) 
and Wi{v), respectively, such that Wi{v) Wi{v) uniformly in S [0, 1]. 
Hence /g Kh{u - v) (du) = Kh{u - v) (du) + Op{n-^/^h-^) by inte- 
gration by parts since K{-) has bounded variation. It follows that 

h^/^ / Kh{u- v)Wi{du) = / Kh{u- v) Wi{du) 
Jo Jo 

= h^/^ Kh{u - v) W^idu) + Op{n'^/^h~^/^). 
Jo 

Since Wf{v) is a Gaussian martingale with covariance matrix of /q du, 
and ji ^^(^ _ y)w^^du) is a mean-zero Gaussian random vector with 
covariance matrix equal to h Kf^{u — v)ll{u) du — > vqT,{v) as /i — > 0. Hence, 
/iV2 Kh{u-v)Wi{du) ^ iV (0, I/O as /i ^ 0, n/i ^ cx). By the Slutsky 
theorem, n-'^/'^h^/^U{v,j3) converges weakly to N{Q,VQTi{v)) as nh? — > oo 
and nh^ — > 0. 

Note that U0{v)) - U{P{v)) = l'^{v, p*{v))0{v) - p{v)), where P*{v) is 
on the line segment between P(v) and f3{v). By Condition A and the uniform 
consistency of (3{v) on v G [a,b] C (0,1), we have n~^l'p{v, P*{v)) = — S(t;) -|- 
Op(l), uniformly in v £ [a, b] for <6 < 1/2. Hence, 

n^/^h^/\P{v) - I3{v)) = -{l'i{v,(3*{v))/nr\-'^/^h^/^U{p{v)) 

(17) 

= (S(^))-in-i/2/,i/2c/(/?(t,))+Op(l), 

uniformly in u € [a, b]. It follows that {nh)'^/'^0{v)- P{v)) N{0, uoT.{vy^) 
as n/i^ — > oo and nh^ — > 0. □ 

Proof of Theorem 4. From (16) and the first line of (17), we have, 
for V £ [a, b] , 

r n^/^{f3{u)-(3{u))du = - /'(^(n))"^ Kh{x - u)Wi{dx) du + Op{l). 

Ja Ja Jo 

Exchanging the order of integration and by the compact support of the 
kernel function K{-) on [—1,1], we have 

r n^''^0{u)- (i{u))du 
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(18) 



v—h 

a+h 
a+h r fv 

a—h 
v+h 

v—h 



{T.{u)y^Kh{x-u) du 
{T.{u))-^Kh{x-u)du 
{i:{u))-^Kh{x-u) du 
{E{u))-^Kh{x-u)du 



Wiidx) + Op{l) 
Wi{dx) 
Wi{dx) 

Wi{dx)+Op{l). 



By Theorem 1, the process Wi{x) converges weakly to a mean-zero Gaussian 
process with continuous paths. Under the assumption (A. 4), {T,{u))~^ Kh{x - 
u)du has bounded variation and converges uniformly to for x £ 

{a + h,v — h). By Lemma 2 of Gilbert, McKeague and Sun [5], the first term 
in (18) is — {Ti{x))~^Wi{dx) + Op(l). Similar arguments lead to the second 
and the third terms in (18) to be Op(l). Hence 



n^/\(3{u) - f3{u)) du 



{j:{x))-'Wi{dx) + Op{l) 



which converges weakly to a p-dimensional mean-zero Gaussian martingale, 
Wy,(^i,)~i{v), with continuous paths. The covariance matrix of VFs(i))-i(^) 
equals to Cov(VFs-i (v)) = S(n)~iS(n)S(n)-i du = ^(u)-^ du. □ 
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